180 research outputs found
Sequeval: A Framework to Assess and Benchmark Sequence-based Recommender Systems
In this paper, we present sequeval, a software tool capable of performing the
offline evaluation of a recommender system designed to suggest a sequence of
items. A sequence-based recommender is trained considering the sequences
already available in the system and its purpose is to generate a personalized
sequence starting from an initial seed. This tool automatically evaluates the
sequence-based recommender considering a comprehensive set of eight different
metrics adapted to the sequential scenario. sequeval has been developed
following the best practices of software extensibility. For this reason, it is
possible to easily integrate and evaluate novel recommendation techniques.
sequeval is publicly available as an open source tool and it aims to become a
focal point for the community to assess sequence-based recommender systems.Comment: REVEAL 2018 Workshop on Offline Evaluation for Recommender System
STEM: stacked threshold-based entity matching for knowledge base generation
One of the major issues encountered in the generation of knowledge bases is the integration of data coming from
a collection of heterogeneous data sources. A key essential task when integrating data instances is the entity matching. Entity
matching is based on the definition of a similarity measure among entities and on the classification of the entity pair as a match
if the similarity exceeds a certain threshold. This parameter introduces a trade-off between the precision and the recall of the
algorithm, as higher values of the threshold lead to higher precision and lower recall, and lower values lead to higher recall
and lower precision. In this paper, we propose a stacking approach for threshold-based classifiers. It runs several instances of
classifiers corresponding to different thresholds and use their predictions as a feature vector for a supervised learner. We show that
this approach is able to break the trade-off between the precision and recall of the algorithm, increasing both at the same time and
enhancing the overall performance of the algorithm. We also show that this hybrid approach performs better and is less dependent
on the amount of available training data with respect to a supervised learning approach that directly uses properties’ similarity
values. In order to test the generality of the claim, we have run experimental tests using two different threshold-based classifiers
on two different data sets. Finally, we show a concrete use case describing the implementation of the proposed approach in the
generation of the 3cixty Nice knowledge base
Sequeval: an offline evaluation framework for sequence-based recommender systems
Recommender systems have gained a lot of popularity due to their large adoption in various industries such as entertainment and tourism. Numerous research efforts have focused on formulating and advancing state-of-the-art of systems that recommend the right set of items to the right person. However, these recommender systems are hard to compare since the published evaluation results are computed on diverse datasets and obtained using different methodologies. In this paper, we researched and prototyped an offline evaluation framework called Sequeval that is designed to evaluate recommender systems capable of suggesting sequences of items. We provide a mathematical definition of such sequence-based recommenders, a methodology for performing their evaluation, and the implementation details of eight metrics. We report the lessons learned using this framework for assessing the performance of four baselines and two recommender systems based on Conditional Random Fields (CRF) and Recurrent Neural Networks (RNN), considering two different datasets. Sequeval is publicly available and it aims to become a focal point for researchers and practitioners when experimenting with sequence-based recommender systems, providing comparable and objective evaluation results
Predicting Your Next Stop-over from Location-based Social Network Data with Recurrent Neural Networks
In the past years, Location-based Social Network (LBSN) data have
strongly fostered a data-driven approach to the recommendation
of Points of Interest (POIs) in the tourism domain. However, an
important aspect that is often not taken into account by current
approaches is the temporal correlations among POI categories in
tourist paths. In this work, we collect data from Foursquare, we
extract timed paths of POI categories from sequences of temporally
neighboring check-ins and we use a Recurrent Neural Network
(RNN) to learn to generate new paths by training it to predict
observed paths. As a further step, we cluster the data considering
users’ demographics and learn separate models for each category
of users. The evaluation shows the eectiveness of the proposed
approach in predicting paths in terms of model perplexity on the
test se
Sequeval: A framework to assess and benchmark sequence-based recommender systems
In this paper, we present sequeval, a software tool capable of performing the offline evaluation of a recommender system designed to suggest a sequence of items. A sequence-based recommender is trained considering the sequences already available in the system and its purpose is to generate a personalized sequence starting from an initial seed. This tool automatically evaluates the sequence-based recommender considering a comprehensive set of eight different metrics adapted to the sequential scenario. Sequeval has been developed following the best practices of software extensibility. For this reason, it is possible to easily integrate and evaluate novel recommendation techniques. Sequeval is publicly available as an open source tool and it aims to become a focal point for the community to assess sequence-based recommender systems
Knowledge Graph Embeddings with node2vec for Item Recommendation
In the past years, knowledge graphs have proven to be beneficial
for recommender systems, efficiently addressing paramount issues
such as new items and data sparsity. Graph embeddings algorithms have
shown to be able to automatically learn high quality feature vectors
from graph structures, enabling vector-based measures of node relatedness.
In this paper, we show how node2vec can be used to generate item
recommendations by learning knowledge graph embeddings. We apply
node2vec on a knowledge graph built from the MovieLens 1M dataset
and DBpedia and use the node relatedness to generate item recommendations.
The results show that node2vec consistently outperforms a set
of collaborative filtering baselines on an array of relevant metric
An empirical comparison of knowledge graph embeddings for item recommendation
In the past years, knowledge graphs have proven to be beneficial
for recommender systems, efficiently addressing paramount issues
such as new items and data sparsity. At the same time, several works have
recently tackled the problem of knowledge graph completion through machine
learning algorithms able to learn knowledge graph embeddings. In
this paper, we show that the item recommendation problem can be seen
as a specific case of knowledge graph completion problem, where the
“feedback” property, which connects users to items that they like, has to
be predicted. We empirically compare a set of state-of-the-art knowledge
graph embeddings algorithms on the task of item recommendation on
the Movielens 1M dataset. The results show that knowledge graph embeddings
models outperform traditional collaborative filtering baselines
and that TransH obtains the best performance
An ensemble approach of recurrent neural networks using pre-trained embeddings for playlist completion
This paper describes the approach of the D2KLab team to the RecSys Challenge 2018 that focuses on the task of playlist completion. We propose an ensemble strategy of different recurrent neural networks leveraging pre-trained embeddings representing tracks, artists, albums, and titles as inputs. We also use lyrics from which we extract semantic and stylistic features that we fed into the network for the creative track. The RNN learns a probabilistic model from the sequences of items in the playlist, which is then used to predict the most likely tracks to be added to the playlist. Concerning the playlists without tracks, we implemented a fall-back strategy called Title2Rec that generates recommendations using only the playlist title. We optimized the RNN, Title2Rec, and the ensemble approach on a validation set, tuning hyper-parameters such as the optimizer algorithm, the learning rate, and the generation strategy. This approach is effective in predicting tracks for a playlist and flexible to include diverse types of inputs, but it is also computationally demanding in the training phase
- …